Search CORE

29,353 research outputs found

Understanding structure of concurrent actions

Author: B Rosman
D Silver
H Wang
RS Sutton
U Luxburg
Publication venue
Publication date: 17/12/2019
Field of study

Whereas most work in reinforcement learning (RL) ignores the structure or relationships between actions, in this paper we show that exploiting structure in the action space can improve sample efficiency during exploration. To show this we focus on concurrent action spaces where the RL agent selects multiple actions per timestep. Concurrent action spaces are challenging to learn in especially if the number of actions is large as this can lead to a combinatorial explosion of the action space. This paper proposes two methods: a first approach uses implicit structure to perform high-level action elimination using task-invariant actions; a second approach looks for more explicit structure in the form of action clusters. Both methods are context-free, focusing only on an analysis of the action space and show a significant improvement in policy convergence times

Central Archive at the University of Reading

Crossref

True Neutrality as a New Type of Flavour

Author: A Giveon
A Salam
A Salam
B Altschul
B Pontecorvo
BS Yuldashev
D Colladay
E Majorana
EJ Konopinsky
J Bell
JD Taylor
LD Landau
LD Landau
LM Slad’
MA Markov
MN Rosenbluth
OW Greenberg
Rasulkhozha S. Sharafiddinov
RB Begzhanov
RS Sharafiddinov
RS Sharafiddinov
RS Sharafiddinov
RS Sharafiddinov
RS Sharafiddinov
RS Sharafiddinov
RS Sharafiddinov
RS Sharafiddinov
RS Sharafiddinov
S Weinberg
SL Adler
SL Adler
SL Glashow
T Prokopec
TD Lee
W Wang
W-M Yao
Y Fukuda
YaB Zel’dovich
YaB Zel’dovich
YaB Zel’dovich
YaB Zel’dovich
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 17/08/2016
Field of study

A classification of leptonic currents with respect to C-operation requires the separation of elementary particles into the two classes of vector C-even and axial-vector C-odd character. Their nature has been created so that to each type of lepton corresponds a kind of neutrino. Such pairs are united in families of a different C-parity. Unlike the neutrino of a vector type, any C-noninvariant Dirac neutrino must have his Majorana neutrino. They constitute the purely neutrino families. We discuss the nature of a corresponding mechanism responsible for the availability in all types of axial-vector particles of a kind of flavour which distinguishes each of them from others by a true charge characterized by a quantum number conserved at the interactions between the C-odd fermion and the field of emission of the corresponding types of gauge bosons. This regularity expresses the unidenticality of truly neutral neutrino and antineutrino, confirming that an internal symmetry of a C-noninvariant particle is described by an axial-vector space. Thereby, a true flavour together with the earlier known lepton flavour predicts the existence of leptonic strings and their birth in single and double beta decays as a unity of flavour and gauge symmetry laws. Such a unified principle explains the availability of a flavour symmetrical mode of neutrino oscillations.Comment: 19 pages, LaTex, Published version in IJT

arXiv.org e-Print Archive

CiteSeerX

Crossref

ContextVP: Fully Context-Aware Video Prediction

Author: A Geiger
A Graves
Alex Graves
C Ionescu
C Ionescu
P Baldi
RS Sutton
S Hochreiter
X Glorot
Z Wang
Publication venue
Publication date: 09/09/2018
Field of study

Video prediction models based on convolutional networks, recurrent networks, and their combinations often result in blurry predictions. We identify an important contributing factor for imprecise predictions that has not been studied adequately in the literature: blind spots, i.e., lack of access to all relevant past information for accurately predicting the future. To address this issue, we introduce a fully context-aware architecture that captures the entire available past context for each pixel using Parallel Multi-Dimensional LSTM units and aggregates it using blending units. Our model outperforms a strong baseline network of 20 recurrent convolutional layers and yields state-of-the-art performance for next step prediction on three challenging real-world video datasets: Human 3.6M, Caltech Pedestrian, and UCF-101. Moreover, it does so with fewer parameters than several recently proposed models, and does not rely on deep convolutional networks, multi-scale architectures, separation of background and foreground modeling, motion flow learning, or adversarial training. These results highlight that full awareness of past context is of crucial importance for video prediction.Comment: 19 pages. ECCV 2018 oral presentation. Project webpage is at https://wonmin-byeon.github.io/publication/2018-ecc

arXiv.org e-Print Archive

Crossref

Studies of oxide/ZnO near-interfacial defects by photoluminescence and deep level transient spectroscopy

Author: Gu QL
Ling CC
Ong HC
Wang RS
Publication venue: 'AIP Publishing'
Publication date: 01/01/2008
Field of study

The evolution of near-interfacial defects from Al2 O3 ZnO and MgOZnO upon thermal annealing has been studied by photoluminescence, deep level transient spectroscopy, and secondary ion mass spectroscopy. We find that all the results are strongly connected and that they point to the direction that Zn outdiffuses from ZnO to the oxide layer during annealing and creates deep level defects near the interfacial region. These defects reduce the band-edge emission and increase the deep level emission at 2.37 eV. Our study shows that the oxide/ZnO interface is relatively fragile and caution must be taken for making metal-oxide-ZnO based transistors and light emitting diodes. © 2008 American Institute of Physics.published_or_final_versio

Crossref

HKU Scholars Hub

Metrics with Prescribed Ricci Curvature near the Boundary of a Manifold

Author: A Besse
A Dancer
A Dancer
Artem Pulemotov
B Chow
C Böhm
DM DeTurck
DM DeTurck
DM DeTurck
E Delay
HH Wang
J Cao
J DeBlois
JH Eschenburg
JL Kazdan
MT Anderson
R Pina
R Pina
RS Hamilton
RS Hamilton
T Aubin
Publication venue
Publication date: 29/03/2013
Field of study

Suppose

M

is a manifold with boundary. Choose a point

o\in\partial M

. We investigate the prescribed Ricci curvature equation \Ric(G)=T in a neighborhood of

o

under natural boundary conditions. The unknown

G

here is a Riemannian metric. The letter

T

in the right-hand side denotes a (0,2)-tensor. Our main theorems address the questions of the existence and the uniqueness of solutions. We explain, among other things, how these theorems may be used to study rotationally symmetric metrics near the boundary of a solid torus

\mathcal T

. The paper concludes with a brief discussion of the Einstein equation on

\mathcal T

.Comment: 13 page

arXiv.org e-Print Archive

Crossref

University of Queensland eSpace

Assessing the Potential of Classical Q-learning in General Game Playing

Author: CB Browne
CJCH Watkins
CP Robert
D Silver
D Silver
H Wang
J Hu
J Méhat
M Genesereth
M Genesereth
M Świechowski
RS Sutton
V Mnih
Publication venue
Publication date: 14/10/2018
Field of study

After the recent groundbreaking results of AlphaGo and AlphaZero, we have seen strong interests in deep reinforcement learning and artificial general intelligence (AGI) in game playing. However, deep learning is resource-intensive and the theory is not yet well developed. For small games, simple classical table-based Q-learning might still be the algorithm of choice. General Game Playing (GGP) provides a good testbed for reinforcement learning to research AGI. Q-learning is one of the canonical reinforcement learning methods, and has been used by (Banerjee

\&

Stone, IJCAI 2007) in GGP. In this paper we implement Q-learning in GGP for three small-board games (Tic-Tac-Toe, Connect Four, Hex)\footnote{source code: https://github.com/wh1992v/ggp-rl}, to allow comparison to Banerjee et al.. We find that Q-learning converges to a high win rate in GGP. For the

\epsilon

-greedy strategy, we propose a first enhancement, the dynamic

\epsilon

algorithm. In addition, inspired by (Gelly

\&

Silver, ICML 2007) we combine online search (Monte Carlo Search) to enhance offline learning, and propose QM-learning for GGP. Both enhancements improve the performance of classical Q-learning. In this work, GGP allows us to show, if augmented by appropriate enhancements, that classical table-based Q-learning can perform well in small games.Comment: arXiv admin note: substantial text overlap with arXiv:1802.0594

arXiv.org e-Print Archive

Crossref

Leiden University Scholary Publications

Multiplication and Composition in Weighted Modulation Spaces

Author: B Wang
B Wang
B Wang
D Jornet
DG Bhimani
E Cordero
G Bourdaud
G Bourdaud
H Helson
H Triebel
H-J Schmeisser
J Franke
J Johnsen
J Peetre
J Toft
J Toft
K Gröchenig
M Sugimoto
PI Lizorkin
RS Strichartz
SM Nikol’skij
T Iwabuchi
VG Maz’ya
Publication venue
Publication date: 31/01/2016
Field of study

We study the existence of the product of two weighted modulation spaces. For this purpose we discuss two different strategies. The more simple one allows transparent proofs in various situations. However, our second method allows a closer look onto associated norm inequalities under restrictions in the Fourier image. This will give us the opportunity to treat the boundedness of composition operators.Comment: 49 page

arXiv.org e-Print Archive

Crossref

Multi-model SAR image despeckling

Author: Wang C
Wang RS
王程
Publication venue
Publication date: 01/01/2002
Field of study

A multi-model despeckling approach for SAR image is presented. The chi-squared test is used to segment the image into homogeneous and heterogeneous regions. Then, the heterogeneous regions are separated into subregions, each of which consists of the points with same edge orientations. Homogeneous regions and the separated subregions are despeckled according to their characteristics. Experimental results are reported

Xiamen University Institutional Repository

Concurrent adaptation to opposing visual displacements during an alternating movement.

Author: A Karniel
B Craske
C Prablanc
CS Harris
EA Franz
GM Jackson
H Mikaelian
J. M. Galea
JC Eliassen
JS Wang
JS Wang
N Wenderoth
N Wenderoth
O Bock
O Bock
P Servos
PM Bays
R Osu
R. C. Miall
RC Oldfield
RS Johansson
SP Swinnen
Y Wada
Publication venue
Publication date: 01/01/2006
Field of study

It has been suggested that, during tasks in which subjects are exposed to a visual rotation of cursor feedback, alternating bimanual adaptation to opposing rotations is as rapid as unimanual adaptation to a single rotation (Bock et al. in Exp Brain Res 162:513–519, 2005). However, that experiment did not test strict alternation of the limbs but short alternate blocks of trials. We have therefore tested adaptation under alternate left/right hand movement with opposing rotations. It was clear that the left and right hand, within the alternating conditions, learnt to adapt to the opposing displacements at a similar rate suggesting that two adaptive states were formed concurrently. We suggest that the separate limbs are used as contextual cues to switch between the relevant adaptive states. However, we found that during online correction the alternating conditions had a significantly slower rate of adaptation in comparison to the unimanual conditions. Control conditions indicate that the results are not directly due the alternation between limbs or to the constant switching of vision between the two eyes. The negative interference may originate from the requirement to dissociate the visual information of these two alternating displacements to allow online control of the two arms

Crossref

Springer - Publisher Connector

University of Birmingham Research Portal

PubMed Central

Improved glucose tolerance in acyl CoA:diacylglycerol acyltransferase 1-null mice is dependent on diet

Author: Arch Jonathan RS
Cawthorne Michael A
Cornick Claire
O'Dowd Jacqueline
Wang Steven JY
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Mice that lack acyl CoA:diacylglycerol acyltransferase (Dgat1(-/- )mice) are reported to have a reduced body fat content and improved glucose tolerance and insulin sensitivity. Studies so far have focussed on male null mice fed a high fat diet and there are few data on heterozygotes. We compared male and female Dgat1(-/-), Dgat1(+/- )and Dgat1(+/+ )C57Bl/6 mice fed on either standard chow or a high fat diet. RESULTS: Body fat content was lower in the Dgat1(-/- )than the Dgat1(+/+ )mice in both experiments; lean body mass was higher in male Dgat1(-/- )than Dgat1(+/+ )mice fed on the high fat diet. Energy intake and expenditure were higher in male Dgat1(-/- )than Dgat1(+/+ )mice; these differences were less marked or absent in females. The body fat content of female Dgat1(+/- )mice was intermediate between that of Dgat1(-/- )and Dgat1(+/+ )mice, whereas male Dgat1(+/- )mice were similar to or fatter than Dgat1(+/+ )mice. Glucose tolerance was improved and plasma insulin reduced in Dgat1(-/- )mice fed on the high fat diet, but not on the chow diet. Both male and female Dgat1(+/- )mice had similar glucose tolerance to Dgat1(+/+ )mice. CONCLUSION: These results suggest that although ablation of DGAT1 improves glucose tolerance by preventing obesity in mice fed on a high fat diet, it does not improve glucose tolerance in mice fed on a low fat diet

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central